AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Grouped query attention

# Grouped query attention

Mistral Nemo Base 2407 Chatml
Apache-2.0
Mistral-Nemo-Base-2407 is a 12-billion-parameter generative text pre-training model jointly trained by Mistral AI and NVIDIA, outperforming models of similar or smaller scale.
Large Language Model Transformers Supports Multiple Languages
M
IntervitensInc
191
3
Llama 3.1 70B
Meta Llama 3.1 is a large language model series supporting 8 languages, available in 8B/70B/405B scales, outperforming most open-source and proprietary chat models in industry benchmarks
Large Language Model Transformers Supports Multiple Languages
L
meta-llama
97.35k
358
Mistral 7B Instruct V0.1 Sharded
Apache-2.0
Mistral-7B-Instruct-v0.1 is an instruction fine-tuned version based on Mistral-7B-v0.1, suitable for dialogue generation tasks.
Large Language Model Transformers
M
filipealmeida
1,363
14
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase